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This paper is concerned with the estimation of the period of an unknown periodic function 
in Gaussian white noise. A class of estimators of the period is constructed by means of a 
penalized maximum likehhood method. A second-order asymptotic expansion of the risk of these 
estimators is obtained. Moreover, the minimax problem for the second-order term is studied and 
an estimator of the preceding class is shown to be second order efficient. 
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1. Introduction 

The framework. Let a; be a random process defined by the equation 

dx{t) ^ f{t/e)dt + dW{t), te[-T/2,T/2],e>0, T>0, (1) 

where / is an unknown real periodic function with period 1, is a period parameter 
which we seek to estimate and W{t) is the standard Brownian motion on [— r/2,T/2]. It 
is assumed that x{t) is observed continuously on [— T/2,r/2] and that 9 belongs to an 
interval Q = [axjI^T]; where q;t,/3t are reals such that < ax < (3t < +oo. Throughout 
the paper, the asymptotic framework T— > +oo is considered. 

Motivation. When / is known, under smoothness and identifiability conditions, the 
maximum likelihood estimator is asymptotically efficient; see Ibragimov and Has'minskii 
[12], Theorem 5.1, page 203. Here, we are interested in the semi-parametric problem: the 
parameter 9 must be estimated, but / is an infinite-dimensional nuisance parameter. In 
a seminal paper, Golubev [9] gives an asymptotically efficient estimator of the period in 
this framework. In a recent article, Gassiat and Levy-Leduc [7] obtain an asymptotically 
efficient estimator of the period in a discretized version of (1) and address the problem 
of estimation of multiple periods when the signal is a sum of different periodic functions. 

From a practical point of view, the problem of period estimation arises in many different 
areas, such as communications, seismic signal processing and laser vibrometry (see Prenat 
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[16]). Model (1) can be seen as the signal observed by a receptor situated at some distance 
from a vibrating source. The frequency of the source vibration equals and the noise 
dTy(t) stands for the degradation of the original signal due to the distance between 
the source and receptor. For other practical applications and further references on the 
subject, we refer to Gassiat and Levy-Leduc [7] and Lavielle and Levy-Leduc [14]. In 
the latter, the authors propose a practical method for estimating the frequency in the 
discrete-time model cited above, when the number of observations at hand is fixed. 
Semi-parametric estimators and second- order efficiency . We now return to the general 
semi-parametric framework. The aim is to estimate the parameter without knowing the 
nuisance function / and, if possible, to obtain the same optimal asymptotic variance for 
the estimator as in the parametric framework (sec van dor Vaart [18], Chapter 25, or 
Bickel, Klaassen and Wellner [1]). If this is possible, which is the case in model (1), we 
say that there is no information loss. 

However, for a given model, there is often a large choice of asymptotically efficient esti- 
mators. This motivates the study of the second order term for the estimator's quadratic 
risk. This problem was studied for partial linear models by Golubev and Hardle [10]. 
In this paper, the authors construct second-order efficient estimators when the nuisance 
function belongs to a known functional class. Golubev and Hardle [11] give nonparamet- 
ric adaptive versions of their estimators. For essentially nonlinear models, the first result 
on semi-parametric second-order efficiency was established by Dalalyan, Golubev and 
Tsybakov [6]. In this paper, the translation model a; (t) = f{t — 6) + en{t), tE [—1/2,1/2], 
is considered and the authors study, as e tends to zero, the asymptotic second-order 
properties of a class of penalized maximum likelihood estimators. Second-order adaptive 
versions of these results are obtained by Dalalyan [5] . 

As noted by Dalalyan et al. [6], the study of this problem is of particular interest since, 
asymptotically, the second-order terms in a semi-parametric setting are often not much 
smaller than the first-order terms. Typically, as we shall see for model (1), when the 
first-order term is T~^/^, the second-order term can be of the order of T~^/^ x T~^/^. 
Objective and results. The goal of this paper is to investigate the second-order efficiency 
in the semi-parametric problem of period estimation. One of the key tools in the paper 
by Dalalyan et al. [6] is the fact that the model can be projected onto a basis and thus 
described as a discrete collection of independent submodels. As we shall see in the sequel, 
this property does not hold in our framework. Thus we must work globally on the whole 
model and hence introduce appropriate methodologies, for instance, the formulation of 
the criterion upon which the estimation is based as the action of a compact operator on 
some functional space. 

In this paper, wc obtain the second-order risk term over a large family of period esti- 
mators. We also obtain a lower bound for the second-order term, which is achieved by an 
estimator of the preceding family. This is the first result about second-order estimation 
in the period model. It provides a theoretical basis for the choice of the smoothing pa- 
rameters in the construction of the estimator. It also highlights the important role played 
by the nonparametric nuisance part of the model. For instance, it originally motivated 
the work of Castillo, Lcvy-Lcduc and Matias [3], where the authors study the problem 
of sharp adaptive estimation of / in model (1). 
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Structure of the paper. In Section 2, we construct a class of estimators using the penal- 
ized maximum likelihood method. Section 3 contains our main asymptotic results: we give 
the second-order properties of the preceding estimators and study the lower bound for 
the risk over all estimators. Moreover, we construct an estimator achieving the optimal 
second-order rate. Section 4 is devoted to technical proofs. 



2. Penalized maximum likelihood estimator 

Let us first introduce some useful notation. As the function / in (1) is 1-pcriodic, we 
may assume that / can be written as a convergent Fourier series, 

fix) = CfcC^'"''"^ = ai + \/2^ a2k cos{2nkx) + a-ik+i sin{2nkx) ^ OkSkix), (2) 

feeZ k>l k>l 

where ei{x) = 1 and, for fc > 1, e2k{x) = \/2cos(27tA:x) and S2k+i{x) — \/2sin(27Tfca;). 

The Fisher information for a continuously differentiable / in model (1) is, as T tends 
to infinity, 

It(/, e) = {l + 0(1)}^ ^(27rfc)^|c,p. (3) 

fcez 

Note that this quantity is asymptotically one quarter of the one given by Ibragimov and 
Has'minskii [12], page 209, since here, the observation interval is [— T/2,T/2] and not 
[0,r]. Also, note that it depends on both f and 9. To simplify the notation, we denote it 
by It in the sequel when we do not want to emphasize this dependence. 

We now construct a family of estimators of the period which allows us to deal with 
efficiency at the second order. We use the method of penalized maximum likelihood 
introduced by Dalalyan et al. [6]. Note that here, model (1) cannot be partitioned into a 
discrete collection of projected one-dimensional submodels since the space of projection 
would be L'^([0, 6]) and 9 is unknown. Thus the object of our study is the global likelihood 
function of the model. 

The likelihood for estimating 6 in model (1) depends on 9 and /, which is given by 
its Fourier coefficients (a^). To eliminate the nuisance parameters [ak), we first assume 
that the ak 's arc independent centered Gaussian random variables with variance a'j. and 
independent of the noise, that is, we put a prior distribution on (afc). To estimate the 
flfe's, wc maximize the posterior distribution of (ofe) given to the observations {x{t)}. 
Note that this is equivalent to the maximization of the joint likelihood of ({a;(<)}, {ak)) 
or of its logarithm, which, thanks to the Girsanov formula (sec, e.g., Ibragimov and 
Has'minskii [12], Appendix 2), is given by the following function $: 

T j 2 T 1^2, 2 

<I>[r,Mt)},(afc)] = -i / /(t/r)2dt+ / /(i/r) dx(t) - ^ ^. (4) 
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Taking the partial derivatives of the last quantity with respect to the Fourier coefficients 
flfe and using the approximation ^k{t/T)f(t/T) dt « OfcT, we find that the maximum 

is approximately obtained for (a^) such that 



(T + a-2)<.= 



T/2 



T/2 



Ekit/r) dx{t). 



Let /* be the function with Fourier coefficients (a^.). Then /57'/2 •^*(^/^) *-^*''(^) 
Ek>iiT + <^u^)al^ and /J/'^, f^it/rf dt ^T^,^, af. Thus 



$[r,Mt)},(4)]«^ 



k>l 



T + a: 



T/2 \ 2 

Ek{t/T)dx{t)) . 
-T/2 / 



For reasons of symmetry (for the minimax problem, we will assume that / lies in a 
Sobolev ellipsoid, see Section 3.3), for > 1, we impose the identity a2k — o'2k+i- This 
means that we put the same weights on sine and cosine for a given frequency. Hence, 
writing; for A: > 0, Afc = {T -\- (j2i.^i)~^T , which is in [0, 1], wc obtain the weighted criterion 



k>l '-1^ 



^2\k-ntlr 



(5) 



Note that we have dropped the first term in the sum since it does not depend on 9. 
Moreover, there is no restriction associated with using the Fourier basis; the preceding 
construction can be used with any orthonormal basis {Ek} of L^[0, 1]. 

Equation (5) can be seen as a weighted version of the estimator proposed by Golubev 
[9]. Note that the weight Xk is outside the square of the integral. We are thus introducing 
a weight on the 'energy' and not directly on the data, contrary to data tapers methods, 
which have been studied in the context of frequency estimation (among others) by Chen, 
Wu and Dahlhaus [4]. In fact, the results obtained by the two methods are of different 
natures. Wc also note from equation (4) that the estimation of the nuisance parameters 
flfc is done by an (approximate) penalized maximum likelihood method and thus the 
'Bayesian approach' leads, in fact, to penalization (see, e.g., Kimeldorf and Wahba [13]). 

We are now able to construct our estimator. As noted by Golubev [9] or Gassiat and 
Levy-Leduc [7], direct maximization of (5) would not allow us to distinguish the multiples 
of the unknown period. To avoid this problem, wc aim to take the smallest approximate 
minimizer of L(t) by choosing 

£t = jrGe, L(r) > (l-log^^/^T)supL(T)|, (6) 
I ree J 

ct = inf ^T- (7) 
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Now, let B{x, i?) = {t G 6, | ^ — 1| < i?}. We define our estimator as 



Argmax L(r). 

reB{eTS/4) 



(8) 



In order to understand the behavior of the criterion L(t). we introduce some useful 
notation. We define symmetrized weights over Z by letting A_fc — Xk and we set Aq = 0. 
The symbols ^j., X^fc^^O' 'l2k>o denote sums over Z, Z* and N, respectively. Then 



where 



L(T) = [r(r)+X(r) + vl/(r)]/2, 



X(t) = 2 ^ AfeT-i y e2"^'"*/^/(V6') dt / c-^i^^Vr ^^/(t), 



T/2 
T/2 



T/2 



(9) 



3. Second-order asymptotics in the period model 

Let us begin this section with some definitions. We say that a function is o(l) (resp., 
0(1)) if it tends to zero (resp., is bounded) as T goes to infinity. If {zk) is a sequence of 
complex numbers indexed by Z, then 



Zk\ 



1 = |zfc| and, form >1, 



:^|z,f(27tfc) 



2m 



We write simply ||z'|| instead of If u is a square-integrable function, ||u|| denotes 

its L^-norm on [0, 1]. In the sequel, C denotes a universal constant. 

3.1. Assumptions on the model 

Let us recall that = [aT,/3T]- We assume that as T tends to +oo, 

(PI) aj;i=0(T) (P2) /3T = 0(logr). 

Let us assume that / belongs to some class F = F{p,Co) of smooth functions whose 
Fourier coefficients (cfc) satisfy the Fourier expansion (2) and 



(Fl) |cip>p>0 (F2) ^(27tfc)4|cfe|2<Co<+oo. 
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We consider sequences of weights (Afc) such that Aq = 0, Ai = 1 and for all integers k, 
A_fc = Afc and < Afc < 1. We also assume that there exists Nt tending to +oo such that 

(WO) Afc = for k > Nt and ^ o{T) as T tends to +oo. 

Moreover, we assume that there exist positive constants pi and Ci such that 

(Wl) IIA'II >pilog^TmaxAfc(27tfc); 

k>l 



(W2) ^Afc(27tfc)''<CiT. 



Finally, we use the following technical assumption: 



(T) 



^(l-Afc)(27Tfc)2|cfcp 



1-^(1 -Afcf(27tfc) 



logT 



^'CfcP 



The o and O in the previous assumptions are meant to be uniform with respect to / and 
A. In the sequel, all a and O will be. Assumptions (Fl) and (F2) are regularity conditions 
on /. Assumptions (WO), (Wl) and (W2) are satisfied for quite a large variety of weight 
sequences. For instance, they are fulfilled for projection weights (l|fc|<7Vi,), provided that 
Nt>C log" T and = 0{T). 

Let us briefly study two consequences of the preceding assumptions. First, (Fl) and 
(F2) imply that there exists a constant h > — for example h = p/Co — such that for any 
integer p>2, 



Y.K,\'<ii-h)j2\c,\ 



(10) 



Second, since the weights are between and 1, we have (1 — Afc)^ < (1 ~ ^^fe) and thus 

n 2 



^(1 - Afc)(27tfc)2|cfcp X ^(1 - Afc)2(27Tfc)2|cfc|2 < 



^(1-Afc)(27tfc)2|cfcp 



L k 



Using (T), we obtain 



^(l-Afc)(27rfc)2|cfcp=o(log-iT). 



(11) 



In particular, note that for each fixed fc, the weight Afc tends to 1 as T tends to +oo. 



3.2. Second-order asymptotics for the risk 

Let us introduce the following hmctional, where the Cfc's are the Fourier coefficients of /. 

(12) 



Rrif, A) =J2{2nkr (1 - Afc)2|cfc|^ + -A^. 

k ^ 



1 
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This functional corresponds to a term of nonparametric estimation. In fact, suppose 
that wc have at hand a Gaussian sequence model defined by = cu +T'~^''^Cfej where 
Cfc = (l/\/2)(Ci,fc + jC2,fc) and {Ci,fc}, {C2,fc} are independent sequences of standard normal 
random variables. Denoting by / the function with Fourier coefficients Cfc, suppose we 
seek to estimate the derivative /' given by the Fourier coefficients cj, — 2\knck- Now, 
consider the linear estimator /' defined by the Fourier coefficients (cj.) = {Xk{2ik7i)yk). 
Then (12) is nothing but the quadratic risk E(||/' — /'|p). 

Theorem 1. Under assumptions (P), (F), (W) and (T), uniformly in f € F and in 
9 Cz Q, the estimator 9* defined by (8) satisfies, as T tends to +oo, 

E,./((r - 9fiT{f. e)) = i + {i + o(i)}M^. (13) 

Remark 1. The preceding theorem gives a second-order expansion for the risk. By 
(11), Sfe(27tfc)2(l-Afc)2|cfc|2=o(log-ir). Moreover, (WO) gives T-^T^ki'^^^fK < 
CT-^N^ = o(T-i/4). Hence, Rrif, A) = o(l) as T tends to +oo. 

Remark 2. The second-order term j|/'||^^i?T(/, A) is the same as the one obtained by 
Dalalyan et al. [6] and similar to the one obtained by Golubev and Hardlc [10]. It seems 
to be a general feature of smooth semi-parametric models that the term of order 2 reflects 
the estimation of the score function of the model. 



Remark 3. In this paper, we consider model (1) on the centered interval [— T/2,T/2]. 
It can be checked that for a different interval of length T, for example [0, T] , equation 
(13) still holds. 

Proof of Theorem 1. Here, we give the main lines of the proof, the technical aspects 
being dealt with in Section 4. A transversal object is the criterion L on which the defi- 
nition of 9* relies. We study its deterministic part F in Section 4.1 by means of Fourier 
techniques. In Section 4.2, we study the stochastic parts X and \1/ by introducing a 
wcll-chosen operator on L^[— r/2,T/2]. 

The first step is to obtain consistency results for 9* . Let us define the event 

A=|sup|X(r) + vI/(r)|<T3/4|. (14) 

The complement of Ao has probability o{T~p) for any integer p, by Lemma 6. 

Due to Lemma 1, with = T^^^, we have sup^^ee !"(''') < T{'^j._^^ |cj,p + 0(7^^^^)}. 

Thus, on the event Aq, we have sup^geL(T) < ^{Efe^^o |cfcP + 0(7^^^^) -}- r-i/4}/2. 

On the other hand, again using Lemma 1 and the definition of ^0, we have, on ^0, 
2L{9) > T{J2k=io IcfcP + o(log^^ T) - T"!/"}. Thus 9 & £t on Aq for suflrciently large T. 
For any integers p>2 and j > 1, bounding the weights from above by 1 and using (10), 
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we obtain A,, jc^ppT < Y.^^^ \cqp?T <(l-h) Y.^^^ \c,\'T. Hence, with et = T-^'\ 
we have £t C Uj>i B{j9,eT) on Ao- Therefore, er & B{9,eT) on Ao- 
The definition of 9* then impHes that 9* ^ B{29,eT) on y^o, thus 

9*eB{9,eT) onA- 

In fact, we need to refine this rate. For any D > 0, let us define the event Ai as 

Ai = {\9* - 9\T^/^9^^ < Dlogi/^T}. (15) 

Lemma 11 estabhshes that for every integer p, for sufficiently large D, P(-4i) is o{T^p). 

The next step is to introduce the variable r, which is a theoretical tool for our proof 
(of course, it is not an estimator since it requires knowledge of 9): 

L'{9) + i?-9)E{L"{9))^0. (16) 

By Lemma 12, r has the desired second-order expansion. It remains to show that 9* and 
r are sufficiently close, that 9* has the same expansion as r at order 2. 

E((r - 9flT) = E((9* - 9)^It1aJ + E((r - 9)Ht1ai)- 

Using (PI), (P2) and (43), wc arrive at the conclusion that 

E((r - 9)Ht1ai) < CPl^a^^T^PiAl), 

which is negligible compared to a given power of T if D is chosen sufficiently large. Now, 

E{{9* - 9flTlA,) 

= E((f - 9)'It) + E([(r - ?)2 + 2(9* 9)]It IaJ - E((f - 9)'1t1ai)- 

The first term in the above expression gives the appropriate expansion, by Lemma 12, 
while the last one is negligible due to (43), (PI) and (P2). The middle term must still 
be dealt with and this is carried out by Lemma 13. Hence, the expansions of the risk for 
9* and r are the same, which proves Theorem 1. □ 



3.3. Lower bound for the risk 

In this section, we establish a lower bound, in the minimax sense, for the second-order 
term over all possible estimators of 9. For any /3 > 0, L > 0, let us define 

The expression of the second-order term in Theorem 1 suggests minimizing the functional 
Rrif, A). The behavior of this functional is well understood; see Pinsker [15] or Tsybakov 
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[17], Chapter 3 for a complete overview of the subject. There exists a function s in W/3,l 
and a sequence q in P such that {s,q) is a saddle point of A) over Wp^L x ^^(Z). 

We have 



rr = i?T(s, q) = inf sup Rrif, A) = ^ V(27tfc)^ 



(17) 



We also have explicit expressions for s and q in terms of the solution Wt of the equation 

/3-1 



fcez* 



vifciy 



1 



{2n\k\) 



2/3 



(18) 



Let g he a function in F(p,Co). Let us denote by gk its Fourier coefficients. For any 
6 > and fixed /3 > 0, L > 0, we define a neighborhood of g as follows: 



Fs{g)^{f = g + v,\\v\\<S,veWp,L}. 



Theorem 2. Suppose that (3>2 and L> 0. For any St ~^ such that there exists a > 
with S^T/W^^" — *■ +00, as T tends to +oo, 

inf sup Egj{(e~9)^lT{ej))>l + {l + o{l)}^, 

where the infimum is taken over all estimators 9 based on the observations {x{t)} and 
where rx is defined by (17). 

Proof. The first step is to change variables so that, contrary to (3), the Fisher informa- 
tion will no longer depend on the parameter of interest. Let us define uj = 9~^. It is not 
hard to check that the Fisher information Jt(/) for estimating u; in model (1) equals 

Mf) = oHrie, /) = {1 + o(i)}r3||/'|| 2/12 

and, in particular, up to the o(l) term, does not depend on 9. Since the mapping 9^9/9'^ 
between the set of estimators of 9 and the set of estimators of lj is one to one, we obtain 

inf sup ^ej{{e^OflT{ej)) = ^ni sup ^^,f{{uo ~ ujf JtU)), 

where the last infimum is taken over all estimators lo based on the observations {x(t)} 
and where the supremum is taken over uj g [P^^,a'^^]. 

The second stop is to bound the minimax risk from below by a well-chosen Baycs risk. 
Let us define a prior on / putting a prior on each Fourier coefficient of /, concentrating 
around the Fourier coefficient gk of g in the following way: 

fk = gk + <yk{(k,i + i^k,2}, 
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where for p=l,2, ^k,p ^ M{0, 1/2) are mdependent and independent of and 

'0, \k\<jTWT, 
(l-7T)|sfe|2, \k\>jTWT, 

where jt = 1/logT and the s^'s are the Fourier coefficients of the function defined in 
(17). We then choose as prior for uj a distribution with density tt on [uttPt], where 
TT vanishes at the cndpoints of the interval and such that the Fisher information J^^ = 
J TY'{uj)^TT~^{uj)dx is finite. 

We denote by ^'(t(/) the distribution associated with the preceding prior on /. We 
then let 

Jt= / JT{f)d^M)^n + o{l)}^Y.^2nk)^{\g,\^ + 2al). 

•' k 

In the sequel, we shall denote by E the expectation in the full Bayesian model. 

For the preceding choice of cr^ , the random function / is close to g with high probability. 
More precisely, one can check (as in Dalalyan et al. [6]) that P(Fay(5)'^) decreases at an 
exponential rate. Moreover, it is also not difficult to check that with this choice of ct^, 
we have 

|12JTr-3-||.g'||2|^o(l). (20) 

2 

r-i^(27Tfc)V = {l + o(l)}rr, withA^fc= f'; (21) 

k ^ +'^k 

Let us bound the minimax risk below using the full Bayesian model: 

inf sup E^j{{Cu-ujfjT{f))>ME({^~^)'J^iFs,ig){f)) (I) 

- supE((ci - u:f{-J^-JT{f))lFs,(g)). (II) 

Cj 

The outline of the proof of the theorem is then as follows. We bound (I) from below: 
(I) > inf JTE((ci - uf) - Cl3^T^FiFs^ (g)^). 

The second term in this difference is negligible, due to the exponential bound on 
P {Fgr,, (g)'^) ■ Let us denote by J7t(w) the Fisher information associated with the ob- 
servations {x{t)} in the Bayesian model with respect to / with fixed uj. The Van Trees 
inequality (see Gill and Levit [8]) applied to the full Bayesian model gives 

infE((cj-w)2) > ^ 



J JT{cj)TT{duj) + 



The key point of the proof is then to obtain an expansion of J't{uj)- Note that as 
the model cannot be partitioned into one-dimensional submodels, the Fisher information 
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Jri^^) is not a sum of Fisher information over submodels and must thus has to be handled 
globally. As T tends to +00, with the cr^'s defined by (19), we have 

JT(^) = JT-{l + o(l)}^^(27tfc)Vfc. (22) 

k 

The proof of (22) is not difficult, but it is technical. For detailed calculations, we refer 
the reader to Castillo [2] . 

Using equation (22), we may deduce 

inf JtE((cj-cj)2) > 



Jt-{1 + o(1)}(TV12) ^ J27TA:) Vfc + J. 
Now, using (20) and (21), we obtain 

inf JtE((w-w)2)> 

^ l-|l + o(l)}||g'|| ^(rT -12r -^J^) 

Using the fact that T^^ = o{rT) and the inequality (1 + x)^^ > 1 — a:, valid for x > —1, 

inf JTE((w-tj)2) > l + {l + o(l)}||,g'||-VT. 

Finally, (II) is negligible with respect to (I), as in Dalalyan et al. [6]. □ 



Remark 4 • Changing variables works well in our framework since the Fisher information 
(3) has a quite simple separated form in / and 6. This might not be the case for other 
models. For instance, in the Cox model, the form of the efficient Fisher information as 
given in van der Vaart [18], page 416 is more involved and a change of variables does 
not seem to make the parameter of interest vanish. We think that this difficulty could 
be overcome by also carefully choosing a prior on the 0-parametcr. 



3.4. Achieving the lower bound 

Theorem 3. Assume that the conditions of Theorem 2 are fulfilled and that there is 
some p> (3>2 such that the sum ^ |27Tfcp^|(7fcp is finite. Then there exists a sequence 
of weights A* such that the corresponding estimator 9{\*) defined by (8) achieves the 
hound of Theorem 2. That is, as T tends to +00, 

sup Ee,/((0(A*) - eflT{6J)) = 1 + {1 + o{l)]\\g'r\T. 

Let us comment on the rate of convergence for the second-order term obtained above. 
Pinsker's theory (see Pinsker [15]) states that, up to a constant which can be computed in 
terms of (3 and L, in [17] is of the order r(2-20)/(20+i)_ p^j. example, in the case /3 = 2, 
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the optimal second-order term is of the order T^^/^. This semi-parametric rate is fairly 
slow compared to the first-order rate, which is, up to a constant, T~'^/^, highlighting 
the importance of second-order terms in semi-parametric estimation. Moreover, the less 
regular the function, the more significant the second-order term. 

Proof of Theorem 3. The idea is to use slightly modified Pinsker weights for A* by 
letting 



X* _ J 1, \k\<lTWT, 

^ [\-{\k\/WT)^-^] + , |fc|>7TWr, ^ ' 



where Wt satisfies (18). ft is not difficult to check that these weights satisfy assumptions 
(W) and (T). Due to Theorem 1, it suffices to check that the suprcmum of Rrif, A*) for 
/ in the vanishing neighborhood under consideration is indeed equivalent to r^. There 
is no difficulty with respect to Dalalyan et al. [6], so this argument is omitted. □ 

4. Proof of Theorem 1 

Definition 1. We say that the probability of a measurable set A is negligible if for all 
integers p, we have 'P{A{K)) ~ o{T^p) as T tends to +00. 



4.1. Behavior of the deterministic part T 



Let us denote by (f> the Fourier transform of the indicator function of [—1/2, 1/2]: (j){x) 



1/2 „2i7t2;t 



/2' 



dt = sin(7Tx)/(7ta;). In the sequel, we use the following bounds on (jj. For any 
p GN, there exist constants Mp > 0, depending only on p, such that 

\4>'^P\u)\ < Mp for ue [-1,1], \$^P\u)\<Mp/\u\ for ]u] > 1/4. (24) 

\(p'{u)\<Ci\u\ forallueM, \i'^^Hu)\ < C2\u\ for all u e R. (25) 

For any real number x, let us denote by A{x) its distance to Z, and by ]x[ the (smallest) 
integer realizing this distance. Let us also introduce the auxiliary notation 



Using (2), we obtain 

r(r)=^AfeT 

k 

r(r) = ^A,r 



T 



ci4>{akj) 



(26) 
(27) 
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For any p,j € Z, we denote hy p A j the greatest common divisor of p and j. 
Lemma 1. Let -fT = T^^^ and Et = T^^/^. Then, as T tends to +00, 



rir)=oij^'/'T), 



r(T) < <^Xqj\Cqp 
I 9 



^fr^ U B{je/p,eT), 

j>l,0<p<7T 



(28) 



\cqp\''+oi-/r'^^)}T, tfTeB{j9/p,eT) {P<-/T,P A J = l), (29) 



m 



o(iog-ir) T. 



(30) 



Proof. First, note that, without restriction, we can just study the sum over \l\ < jt 
in (26). Indeed, using the Cauchy-Schwarz inequahty, ||A||i < Nt = o(T^/**) and (F2), 
we have 



\1\>1T 



i^ 

\1\>1T |'|>7t 



(31) 



In the remainder of the proof, p and j are two relatively prime integers such that p <jt- 
Also, note that by symmetry, we can always assume that k is positive. 

Assume that r is not in a ball B{j6 /p^et) with p < 'Jt and j > 1. For any / such 
that \l\ < 7t and any integer fc, this implies that |afc_i| > T(3^^et > 1. By (24), we have 
\<i){ak,i)? < PtT-^e:^^. Using (26) and (31), this yields r(r) = o{-/~^^^T). 

Assume that t is in a ball B{j9/p,ET) with p < jt and p Aj = 1. In this case, we 
observe that the sum over l y^]k9/T[ in (27) is negligible, due to the triangle inequality, 
(F2) and (24). There are then two cases for the integer k in (27): 

• There exists an integer q such that k = qj. Then, for sufficiently large T, ]k6/T[ = pq 
since 

\k9/T — pq\ < pqET < JtNtEt = o(l)- 

• The integer k is not a multiple of j . Then k = qj + r with r < j . Note that 

^ , "^P^T . „ . ^ItNtEt 

< pqET H < CpqET < C . 

J 3 

Since pAj = l, k9/T is at a distance to the integers larger than l/2j. Since r must 
lie in 9, we have j9/p < 2/3t- Thus, j'^ > 9/{2pPt)- Therefore, 



k9 f 




--[pq^ 


-1) 







> 



T 



which allows us to prove that the corresponding term is negligible, using (24). 
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Finally, writing (27) at t = 9, it is easy to check, by similar arguments, that 
T{e) = J2 AfelcfepT + o{{i3tT-^ + NTf3^T-^)T}. 

k 

Moreover, due to (11), Emo(1 - ^k)\ck? < - Xk){2nk)^\ck\^ = o(log-ir), which 

establishes (30). □ 

Let us define the following weighted Fisher information: 

= I(A') = ^E^'(27tfc)2|c,p. (32) 

fe fe 

Lemma 2. As T tends to +oo, 

T'{ef ^T^9-^o{\\x'fT-^), r"{e) = -2i{x) + T^e-'^o{\\x'fT~^). 

Proof. For any r in 8, taking the derivative with respect to r in (9), 

r'(r) = — ^Xkk^c^ci{4)'{ak,i)(i){ak,p) + (i)'iak,p)(i)iak,i)). (33) 
''' k p,l 

Now, consider the case t = 9 and write the sum over p, I in (33) as 



E +1 E + E 1+ E =«+^- 

p,/ p—k,l—k \ p—k,l^k p^k.l — k ) p^k,l^k 



(34) 



Then note that a = 0, \b\ < \ck\Y.p\cp\OT-^ and |c| < {Y.p\cp\)^S^T-^ ■ Hence, r'{0) = 
0{T/9), which, using (P2) and (Wl), gives the first expansion. The expansion for T"{6) 
is obtained similarly. □ 

Let us define the set Vai as 

Va, = {t e e, \t-9\< DT-'^/^9^ log^/^ T}. (35) 

Lemma 3. As T tends to +oo, uniformly in t €z Vai7 we have 

r'{T)=0{9-'^T^/^log^/^T), T"{t)=0{9-'^T^), T'-^Ht) ^ o{9~^T^). 

Proof. Using (25) and the fact that T lies in V^i , |0'(afc,fc)| <Ci\ak,k \ <CkT-^^^\og^/^T. 
We again write the sum (33) as in (34). By to (25) and (F2), the term corresponding 
to T,p=k=i is bounded above by CT^/^9-^ \og^/^ TJ2 Xkk^\ck\^ < \og^^^ T. 

Similarly, one concludes that the term J2p=k i^k + '^p=ik i=k is bounded above by 
C{T9-^ + T^/^9-^ log^/^ T) and that the term Y.p^k,i^k is bounded above by Xkk < 
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7V|,, which together with (P2) and (WO), gives the result. The other two expansions are 
obtained similarly. □ 

4.2. Behavior of the stochastic parts X and ^ 

Let KJp be the operator on L2([-T/2, r/2]) such that for any g in L'^([-T/2,T/2]), 
IKt.9W=/ < VAfcC^'^'^^*-")/^ i.9(w)du= / Klj.[t - u)g{u)Axi. 

J-T/2 [ J J-Tjl 

Then X(t) is nothing but 2 jK5^/(-/^)(i)dM^(t). Thus, X and its derivative are centered 
Gaussian random variables of variance 



E(x(pH^)') =4 f^^ |[K5,/(./0)]('')(O|'dt. (36) 

J-T/2 



On the other hand, note that for every T and r, the operator KJ, acting on L^[— r/2, T/2] 
is self-adjoint and compact. Namely, it is a convolution operator with bounded kernel 
KTp. The spectral theorem (see, e.g., Yoshida [19]) then states that it can be characterized 
by eigenvalues {f3k}k£z an orthonormal basis of eigenvectors {vk}ki£z- 

Using expansions over the basis {vk}, one can check that there exists a sequence 
{ak}k<£z of independent A/'(0, 1) random variables such that 



rT/2 i-T/2 
'T/2 J ~T/2 



*(t)= / / K-T{t-u)dW{u)dW{t) = y^l3kal. (37) 

J ~T/2J ~T/2 u 



This also holds for the derivatives of replacing the kernel by its derivatives. Let 
us denote by (3^'^ the eigenvalues of K^''^''. It then follows from (37) that for p > 0, 



E[M/(rf(r)]=^/3i^)=/ A-(^)(0)dt, (38) 
Var[*(P)(T)] =2V/3{^'')' = 2 / / |Ay'^)(i - w)|' dt du, (39) 

J-T/2 J-T/2 



where Var denotes the variance. These formulas allow the deviations of the process 'I''-'') 
to be controlled through the study of its Laplace transform. 

Lemma 4. There exists C > such that, for all t £ 9, E(x(r)2) < CT. 
Proof. Let jk = J2i'^iH<^k,i)- Then 



E(X(r)2) ^AtJ 



1/2 
1/2 



^ Afe ^ ci4){akj)cxp{2ikntT/ t} 



I 



dt. 
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E(x(r)2) 4r^A2|7,,|2 +4r ^ AfeAfc-7feWO(/3Tr-i), 

k k^k' 



where we used the fact that J^y2 exp{2i(fc — k')ntT/T} dt = 1^=^' + lk^k'0{(3TT ^) for 
all integers fc,/c'. The first term in the last sum is similar to T{t) (see (26), with 
replacing A^) and thus it is a OCT) uniformly in r thanks to Lemma 1. The second term 
is 0{\\\\\\I3tT~^) , which proves the lemma. □ 

Lemma 5. For all t CzQ, for all positive integers p, 

m^'Hr)') < CT2J'+ir-4J'^A2(27rA:)2p(|c],,/,[|VO(/32T-2)), 

k 

E(^'(f)(T)2) < CT'^PT~'^P\\X^P^f. 

Proof. Defining ^fe(T) = c;0(afe^;(T))}(P^ , the derivative being with respect to r, we 
have 



/•I, 



1/2 
/2 



^A,e™/-a- 



dt 



< AT 



k^m 



L k 

2|C \\2 



Since | Efe^m ^fe^m^?™! < {Y.kK\^k\Y < ^rEfe-^felCfeP' by the Cauchy-Schwarz in- 
equality, 

E(x(^)(r)2)<CT^A^|e,p. 

k 

One now checks that the predominant term coming from the pth derivative while evalu- 
ating is TPT-^PkP$^P\ak,i). Hence, 

\^k\^<CT'pe~^pepJ2h4>^''\ak,i)f 



< CT^PT~^P{2nk) 



2p 



|c]fce/T 



which proves the result for X*-^^- 

To bound ^^p'^ , we use (39). Again, one checks that when evaluating the pth derivative, 
there is one dominating term, which is 

E(^'(p)(r)2) < CT-V-''PEAfcAi(27r)2p(fc/)P 

k,l 
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i-T/2 

X / / {t-u)^>'cxp{2in{t-u){k-l)/T}dudt 



'-T/2 J-T/2 

< CT-V-4p^A^.(27T/c)2PO(r2p+2)_ 



□ 



Remark 5. In the sequel, we use Lemma 5 for neighborhoods V oi 9 such that, for any 
r e V and any integer k such that |A:| < Nt, we have ]k9/T[ — k. 



Lemma 6. For any integer p, as T tends to +oo, 



P sup|x(t) + 1'(t)| >fog^T 
\ree 



o{T-P). 



Proof. The proof is not difhcuh, using Lemma 14 (as in Golubev [9]), and is thus 
omitted. □ 

Lemma 7. If^X^) is defined by (32), then as T tends to +oo, 
E(X'(6I)2) = 4I(A2) +T^e-^o{\\X'fT-^). 

Proof. Let ys =J2iCi{2m)${bi^s) and Zs ^ -Y.i'^i4>'{k,s)- Then 

.1/2 



T-36»4e(x'(61)2) = 4 / 

J -1/2 



\kk{tyk - Zk) exp{2i7t/ciT/6l} 



di 



= 4^A^fc2(|y,|2/i2+|z,|2) 

k 

+ 0{eT^^)Y Xk\pkp{\yky^\ + \yk'z^\ + \y^Zk\ + \zkz^\). 
k^p 

Now, note that y^ = 2mcs + O{0T-^) and Zs = O{0T-^). This yields the resuh using, for 
the remainder term, the fact that, by the Cauchy-Schwarz inequality and (F2), we have 
Ek Afcfc'Icfel < C|1A|1 and Xkk\ck\ <C. □ 

Lemma 8. As T tends to +oo, 

E(^'(0)2) = T^-^{1 + o(l)} ^(27T/fc)2A^/3. 



Proof. Using (36), we obtain 
E(*'(6l)2) ^STt^T^ei-"^ VfcpAfcAp /" ( c-x:p{2i'n{t~u){k~p)/e){t~uf&tdu, 

J-T/2 J~T/2 



k.p 
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Lemma 9. // we recall that AAt ~ max/j Xk{2nk), then the quantities 

p(^sup|x"(t)| >T5/2iogy^-4^^ P(^sup|^"(r)| >T5/2logTr'^), 



P{sup\x'^^\T)\>T'^/^\ogTMTO' 



f sup 



are negligible in the sense of Definition 1. Moreover, 



E( sup|x'''(7 



:O(r»0-^"), E(sup 



1 ' 



Proof. The proof is standard, using Lemmas 14 and 5, and is thus omitted. 



□ 



4.3. Conclusion of the proof 

Lemma 10. Let fir = T^^^^ and B = B{e,eT-^ ^f}) . Then, for any t e B , we have 

r(r) - T{e) = -{t- ef I(A) +o{(r - ef i(A) + (r - l(A)i/2}, (40) 

X(r) - X(e) = 2(t - 6){l + 0(1)} I(A2)^/V+ (r - 0)7^l(r), (41) 
*(r)-^'(^^) = (r-^^)7^2(T), (42) 

where Af is an A/'(0, 1) random variable and for i = 1, 2, the process TZi satisfies 

3Ci,C2 >0,Va;e [0,logT] P ^sup |7^,| > xl^^^^ < (1 + Ci/^Ta;) exp(-C2x2). 
Proof. To prove (40), note that 



l^k 



Ck 

l^k 



Y,XkT\ck\'mak,k)\'^l} 



a + /3 + 7, 



^ci4){ak,, 

l^k 



l^k 



+ 7 
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where 7 regroups the crossed terms coming from each of the squares. But for all real u, 
(j){u) = 1 — + u^ipiu) with ij} bounded on R, so 



a : 



^(^,3^-2^-2/^2) ^ Afc(27Tfc)2|cfc|2(r - ef{l + fcT(r-i - 0-^)0(1)}. 



Now, note that by (Fl), the rate of 1(A) is a constant times T^O^'^. Using the fact that 
^ /c'^jcfep is finite and the fact that r lies in S, we obtain a = — l(A)(r — 6')^{1 + o(l)}. 
To bound 1/31, we use the inequality ||a|2 - < |a- 6|(|a| + |6|) and (24): 



l^k 



I 



sup 



\ak,k\<C{2e/T)\k\\l^e/T\T/9, 



Y,hmM\ + \khm<ci29/T). 

l^k 

Thus, |/3| is bounded from above by CJ2k>'k\k\\T - 8] < CN^\t - 6], which is o((r - 
9)l{Xy^^)- The same holds for 7, similarly. 

Note that by means of a Taylor expansion, there can be seen to exist random reals c 
and d such that: 

X(t) - X{e) = X'(0)(t - 0) + X"(c)(r - 0)2/2, 
*(t)-1'(6I) = *'(d)(T-6i), 

where the distribution of X'{0) is 7V(0, E(x'(6')2)) and E{x'{9)'^) is given by Lemma 7. 

Let us now control the deviations of X"(c). It suffices to control the supremum of 
X"(t) for r in B. Note that for any 7 > 0, 

E(cxp{27X"(r)}) = exp{272E(x"(r)2)}. 

Using Lemma 5 and the definition of B, we obtain that, for any y > and 7 > 0, 

P (^sup I X"(r)| >y^<2 cxp{^jy + CT'^O-'^j^) (^1 + jCT'^/^ 

< 2 exp(-72; + CT'^O-^ + -^CT^'^ Nt9-'^ n:^^). 

Letting y ~ i.itT^^'^0~^x and 7 ~ ■qT^^^^O'^x with sufficiently small 77, we obtain the 
desired bound for TZi. 

Finally, we control the deviations of 'f' on B. For any real 7 such that 7"-'^ > 8 sup^ (5^'^ , 
using (37) and the inequality — log(l — m) < it + 11^ for u < 1/2, we have 

E(exp{27*'(T)}) = expJ - ^ log(l - 4jPI^^)/2 I < exp J 27^/3(1) + 87' ^ /^^'^ ' \ ■ 
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Then, using (38), YpPp^ Using (39) and Lemma 5, YpPv^ <T'^t~^\\\^^^\\'^ . In 

particular, supp/Jp^-* < Cr6'~^||A^^^||. To conclude, wc apply Lemma 14 with 7 = ^^Ij^^'^x 
and sufhciently small 77. □ 

Lemma 11. As T tends to +00, for any integer p and sufficiently large D, 

P{\g* ^ 0\T^/^0-^ > Dlog^/^ T) = o{T-P). (43) 

Proof. We proceed in two steps. Recall that from Section 3, we have 9* e B{9,eT) on 
Ao, where P(ylg) is negligible. We first show that P{e* € B{0,eT-^ fif,^)") is negligible. 
Note that if y is a neighborhood of 6 such that ]k6/T[^ k for all |fc| < Nt and r in V 
(e.g., V = B^O.et)), we have 

r(r) - m = ^ - I) + O (tNt^^ . 

Let us define C = 5(6*, et) n 5(6*, j^)^ We next prove that P{e* e C) is negligible. Note 

that for C = D = tt? and any u e R, we have |(^(w)P - 1 < -~Cu^{D + ii^)"\ thus 

for any r e C , 

r(T) - m < -T^ A,|cfcp-^-J-^ + OiNrPT) 
<-Tiif?C Xkk\k? + 0{NTl3T) 

\k\ <fiT 

< -CT^i:f? since h^(3tNt = o{T). 
Recall that 9* is defined by (8) and let = X + Then 



P(r e C) < V{6 i B{eT, 1/4)) + P \h{e) < supL(r) j , 

P{e* e C) - PiA'o) < P ( r(e') < supr(r) + 2sup |r/(r)| ) < P ( 2 sup |r/(r)| > CT^i;^^ 
\ c e / V e 

Using Lemma 6, we conclude that P{9* G B{d, j^Y) is negligible. 

Letting A' = {9* e B{9, ^)} and = { >\t-9\> xl~^^'^} for a; > 0, we have 

P(|r-6l|4/^>a;)<p( sup L(r) > L(6l) ) < P ( supL(r) - L(6l) > ) + P(y^'^). 

\\t-9\>x1-^/^ /Vs. / 

Now, write L(t) - L{9) = T{t) - T{9) + X(t) - X(6') + «'(t) - ^'(6'). Note that for suf- 
ficiently large T, by (40), we have r(T) - T{9) < -(r - 6')2 l(A)/2 for any r g E^. Then 
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using expansions (41) and (42), we obtain 

p{\e* ^9\i}/^ >x) 

< P f sup(T - 0)[-i(r - 9) 1(A) + 2I(A2)'/ V + (Tei + Tea)] > ) + P(^") 



<p(x<C7V) + p( x<i:r^^^siip\ni+n2\ ] +p{A' 



To conclude, we use the standard bound for the tail of a Gaussian random variable and 
Lemma 10, and we set x = D log^^^ T with sufficiently large D. □ 



Lemma 12. As T tends to +oo, 



^{{9-eflT)^l + {l + o{l)} 



Rt[LX) 



Proof. The definition of T implies that E((r- 6')2It)[E(L"(6I))]2 = E(L'(6')2)l7.. To com- 
pute the expectations, note that (38) is zero for p > 1 and that 'X!{9)'^'{9) has zero mean 
since it is a product of an uneven number of stochastic integrals with respect to dW{t). 
Thus, 4[E(L"(6l))]2 = T"{eY and 4E(L'(6')2) = T'{ef + E(x'(6l)2) + E(«''(6')2). 
Using Lemmas 7, 8 and 2, we can write the following expansion of the risk for r: 



rp3 



E((r- 



|A'i 



/||2 



1 + \\f\r-lj2(^l l)(27tfc)2|c,|2 + 1 ^ A2(27rA:)2 + o 

{ k k 

i+\\r\rj2i^k~i){2nkr\c,\'+ol^^^ 



Using (11), we can expand the denominator. Then using the expansion of (1 — e) ^ 
around e = and the fact that, due to (T), J^ki^ - Afc)(27Tfc)2|cfcp = o(i?T(/, A)^/^), we 
obtain 



E((r- 



1 + 11/ 



'II -2 



^(A^ - l)(27rfc)2|c,p + 1 ^ A2(27rA:)2 + a 



|A'| 



'l|2 



1 - 2\\fT''J2(^k - l)(27tfc)2|cfcp +o(i?T(/, A)) 
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E((f - 0)2It) = 1 + ll/'ir^ Y.^2nkf {{I \,f\cu\^ + ^Xl^ + o(i?T(/, A)). ^ 

Lemma 13. Let Rt be the functional defined by (12). Then, as T tends to +oo, 

E((r - ?)Ht l^J = o(i?T(/, A)T-i), (44) 
E((r-?)(f-0)lTl^J-o(i?T(/,A)T-i). (45) 

Proof. Note that by Lemma 1 combined with Lemma 6, the criterion L admits on ^ 
local maximum inside the ball B{eT, 1/4). Thus, L'(0*) = on this event. Using a Taylor 
expansion of L(t), there exists lo S \0, 0*] such that 

L'(r) = = L'(6') + (r - e')L"(e') + \{e* - efi.^^\Lo). 

Using (16), we deduce that 

{e* - f)E(L"(6')) = {9- e*){V'{0) - E(L"(6i))} - \{e - r)2L(3)(t^). (46) 
Note that using Lemmas 3 and 9, 



E 


sup|L(3)(c^)|2 


<E 


sup|r(3)|2 4 


.|X(3)|2 + |VI,(3)|2 






Lv^, J 











Notethat on A, [9* -Of < D'^d*T-^logT. Moreover, thanks to Lemma 2, E(L"(6'))"2 ^ 
4E(r"(6'))-2 < (CT^0-i)-^. Thanks to Lemma 5, 

E{L"(6') - E(L"(6i))}2 < E(X"(6I)2) + E(*"(6i)2) < CT^e~\ 

From the preceding inequalities, we conclude that 

E((r - t)Ht1a,) < C(logT)5/2r-i < C(logr)-3/2r-i||A'||2 = oiRrif, A)), 

by (Wl), which proves (44). Finally, (45) is proved by similar arguments. □ 

Lemma 14. Let (Xt) be a stochastic process differentiable a.s., fi and x positive real 
numbers and I an interval of^. We then have 

P(supXr>a:) <exp(-Aia;)sup(Eexp(2/iXr))^/^(l+Ai / (E| x'^ H^^^ dr ) . 

\re/ / re/ V Jrel J 

Proof. The proof of this lemma can be found in Golubev [9] . □ 
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